Search CORE

15 research outputs found

A Speech Distortion and Interference Rejection Constraint Beamformer

Author: Benesty J
Habets EAP
Naylor PA
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Signals captured by a set of microphones in a speech communication system are mixtures of desired and undesired signals and ambient noise. Existing beamformers can be divided into those that preserve or distort the desired signal. Beamformers that preserve the desired signal are, for example, the linearly constrained minimum variance (LCMV) beamformer that is supposed, ideally, to reject the undesired signal and reduce the ambient noise power, and the minimum variance distortionless response (MVDR) beamformer that reduces the interference-plus-noise power. The multichannel Wiener filter, on the other hand, reduces the interference-plus-noise power without preserving the desired signal. In this paper, a speech distortion and interference rejection constraint (SDIRC) beamformer is derived that minimizes the ambient noise power subject to specific constraints that allow a tradeoff between speech distortion and interference-plus-noise reduction on the one hand, and undesire d signal and ambient noise reductions on the other hand. Closed-form expressions for the performance measures of the SDIRC beamformer are derived and the relations to the aforementioned beamformers are derived. The performance evaluation demonstrates the tradeoffs that can be made using the SDIRC beamformer

Fraunhofer-ePrints

Spiral - Imperial College Digital Repository

Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features

Author: A Mohamed
A Sehr
B Atal
B Cauchi
BH Juang
BT Meyer
D Povey
EAP Habets
G Hinton
G Langner
I Kodrasi
K Lebart
KE Muller
MR Schroeder
MR Schädler
N Moritz
R Martin
SB David
T Dau
T Gerkmann
T Nakatani
T Yoshioka
Y Ephraim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Phase reference for the generalized multichannel Wiener filter

Author: C Knapp
EAP Benesty
EAP Habets
I Cohen
I Kodrasi
J Chen
J Chen
J Chen
J Freudenberger
J Schmalenstroeer
JB Allen
L Wang
M Schwab
MR Schroeder
MS Brandstein
PA Naylor
R Stewart
S Doclo
S Doclo
S Doclo
S Doclo
S Gannot
S Markovich-Golan
S Markovich-Golan
S Markovich-Golan
S Miyabe
TC Lawin-Ore
TC Lawin-Ore
TG Dvorkind
TG Manickam
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition

Author: A Krueger A
A Mushtaq
A Sehr
AA Nugraha
AA Nugraha
AA Nugraha
Aditya Arie Nugraha
AY Nakano
AY Nakano
B Raj
BS Atal
C Avendaño
E Habets
EAP Habets
H-G Hirsch
HG Hirsch
HG Hirsch
HG Hirsch
J Droppo
JS Erkelens
K Itou
K Lebart
K Shimada
Kazumasa Yamamoto
KP Markov
L Deng
L Prechelt
L Wang
M Delcroix
M Nakayama
M Riedmiller
M Wolfel
O Ichikawa
P Jinachitra
PA Naylor
PJ Moreno
Rojas R (ed.)
S Furui
S Nakagawa
S Nissen
S Nissen
S Nissen
SE Fahlman
Seiichi Nakagawa
T Fukumori
T Ishii
T Ishii
T Nakatani
T Yoshioka
T Yoshioka
V Leutnant
W Li
W Li
X Lu
Xia B-y Bao C-c
Y Pan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Analysis of dual-channel ICA-based blocking matrix for improved noise estimation

Author: A Hjørungnes
A Hyvärinen
A Krueger
A Lombard
A Ozerov
A Ozerov
B Cornelis
BD Van Veen
C Knapp
E Warsitz
E Weinstein
EAP Habets
H Buchner
H Buchner
H Buchner
H Kuttruff
H Sawada
HL Van Trees
I Cohen
I Cohen
I McCowan
K Jeon
K Kim
K Reindl
K Reindl
K Reindl
L Parra
LJ Griffiths
M Taseska
M Taseska
N Ito
N Moritz
NK Duong
O Hoshuyama
OL Frost
P Smaragdis
P Vary
PP Vaidyanathan
R Aichner
R Maas
R Martin
R Martin
R Talmon
R Zelinski
S Araki
S Araki
S Gannot
S Golan
S Jeong
S Makino
SV Gerven
T Gerkmann
T R Hendriks
T Van den Bogaert
W Herbordt
W Kellermann
Y Takahashi
Y Zheng
Y Zheng
Ö Yilmaz
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

DoA reliability for distributed acoustic tracking

Author: Evers C
Gannot S
Habets EAP
Naylor PA
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/06/2018
Field of study

Distributed acoustic tracking estimates the trajectories of source positions using an acoustic sensor network. As it is often difficult to estimate the source-sensor range from individual nodes, the source positions have to be inferred from Direction of Arrival (DoA) estimates. Due to reverberation and noise, the sound field becomes increasingly diffuse with increasing source-sensor distance, leading to decreased DoA estimation accuracy. To distinguish between accurate and uncertain DoA estimates, this paper proposes to incorporate the Coherent-to-Diffuse Ratio as a measure of DoA reliability for single-source tracking. It is shown that the source positions therefore can be probabilistically triangulated by exploiting the spatial diversity of all nodes

Southampton (e-Prints Soton)

Fraunhofer-ePrints

Spiral - Imperial College Digital Repository

A multichannel diffuse power estimator for dereverberation in the presence of multiple sources

Author: EAP Habets
EAP Habets
Emanuël A. P. Habets
F Jacobsen
HQ Dam
JB Allen
JD Polack
K Kokkinakis
K Lebart
M Delcroix
M Miyoshi
M Souden
M Togami
N Kitawaki
O Thiergart
R Martin
R Roy
S Gannot
S Gergen
S Markovich
S Mosayyebpour
Sebastian Braun
T Falk
T Gerkmann
T Nakatani
T Yoshioka
X Bao
Y Huang
Z Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

Author: EAP Habets
H Cox
IS Gradshteyn
J Benesty
J Ramırez
JD Gibbons
JD Polack
JS Bradley
K Lebart
M Friedman
Naylor PA
PC Loizou
R Martin
RO Schmidt
S Goetze
T Falk
T Gerkmann
T Gerkmann
Y Ephraim
Y Ephraim
Y Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Speech enhancement with a GSC-like structure employing sparse coding

Author: A Krueger
AM Aharon
CD Sigg
EAP Habets
H Rauhut
ITU
J Mairal
J Sohn
JF Gemmeke
K Engan
K Skretting
L Griffiths
L Rebollo-Neira
M Elad
M Eshaghi
M Kowalski
MD Plumbley
O Hoshuyama
R Gribonval
R Martin
R Martin
R Talmon
S Gannot
SG Tanyer
SJ Wright
W Herbordt
Y Avargel
Y He
Publication venue: 'Zhejiang University Press'
Publication date
Field of study

Crossref

A reverberation-time-aware DNN approach leveraging spatial information for microphone array dereverberation

Author: A Jukić
A Keshavarz
B Rafaely
B Wu
BD Radlovic
CH Taal
CW Gardiner
DW Griffin
EA Lehmann
EAP Habets
G Zelniker
GE Hinton
GE Hinton
J Benesty
J Dmochowski
J Ma
K Han
K Kinoshita
K Kinoshita
M Brandstein
M Wu
PA Naylor
S Gannot
S Mosayyebpour
ST Neely
T Lotter
T Nakatani
T Yoshioka
TD Abhayapala
TF Quatieri
WC Sabine
Y Bengio
Y Xu
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2017
Field of study

Abstract A reverberation-time-aware deep-neural-network (DNN)-based multi-channel speech dereverberation framework is proposed to handle a wide range of reverberation times (RT60s). There are three key steps in designing a robust system. First, to accomplish simultaneous speech dereverberation and beamforming, we propose a framework, namely DNNSpatial, by selectively concatenating log-power spectral (LPS) input features of reverberant speech from multiple microphones in an array and map them into the expected output LPS features of anechoic reference speech based on a single deep neural network (DNN). Next, the temporal auto-correlation function of received signals at different RT60s is investigated to show that RT60-dependent temporal-spatial contexts in feature selection are needed in the DNNSpatial training stage in order to optimize the system performance in diverse reverberant environments. Finally, the RT60 is estimated to select the proper temporal and spatial contexts before feeding the log-power spectrum features to the trained DNNs for speech dereverberation. The experimental evidence gathered in this study indicates that the proposed framework outperforms the state-of-the-art signal processing dereverberation algorithm weighted prediction error (WPE) and conventional DNNSpatial systems without taking the reverberation time into account, even for extremely weak and severe reverberant conditions. The proposed technique generalizes well to unseen room size, array geometry and loudspeaker position, and is robust to reverberation time estimation error

Crossref

Directory of Open Access Journals

Open Access Repository